Selection Algorithms

نویسندگان

  • Dorit Dor
  • Uri Zwick
چکیده

Given a set X containing n distinct elements, and given a number 1 i n, we would like to nd the element whose rank in X is exactly i, i.e., the element of X larger than exactly i 1 elements of X and smaller than the other n i elements of X. In particular we are interested cases where i = n for some 0 < < 1. The median of X is the dn=2e-th element. Improving a long standing result of Sch onhage, Paterson and Pippenger we show that the median (as well as every other element) of a set containing n elements can always be found using at most 2:9423n comparisons. We also describe an algorithm for selecting the n-th largest element, using at most (1 + (1 + o(1))H( )) n comparisons, where H( ) is the binary entropy function and the o(1) stands for a function that tends to 0 as tends to 0. For small values of this is almost the best possible as there is a lower bound of about (1 + H( )) n comparisons. We use some ideas from the median selection algorithm and we obtain a better selection algorithm for some intermediate values of (e.g., the dn=4e-th element can be found using only 2:69n comparisons). The underlying technique for our upper bounds is mass production. Mass production is carried out in factories which are originally used by Schonhage et al. in their 3n+ o(n) median algorithm. We introduce green factories and perform an amortised analysis of their production costs. On the other hand, we show a new lower bound for selection algorithms. We obtain a lower bound of (1 + H( ) + ( 4)) n o(n) comparisons for restricted selection algorithms. The known selection algorithms, including the ones in [BFP+73] and [SPP76] as well as the selection algorithms presented in this thesis, comply with this restriction. This bound gives in particular, a lower bound of 2:00247 n comparisons on median selection. We then extend this lower bound and obtain a lower bound of (2 + ) n comparisons for any median selection algorithm (for some > 0). Unfortunately, the improvement obtained for the general case is very small. We extend another result of Schonhage, Paterson and Pippenger [SPP76] and show that a conjecture of Yao implies that the n-th element can be found using at most (1 + log 1 +O( )) n vii comparisons (but does not give an explicit algorithm). This is even closer, but still above, the (1+H( )) n lower bound. We show, in particular, that if Yao's conjecture is true then the n-th element, where 1=3 1=2 can be found using at most (2 + ) n + o(n) comparisons. Yao conjectured that extra elements can not be used to obtain better selection algorithms. We describe an algorithm that selects the n-th element given (1 + )n elements (i.e., given n extra elements), whose complexity is better than the worst case complexity of any other known algorithm when 0:4711 < < 0:1724. This confronts Yao's conjecture with a concrete challenge.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR

Feature selection is of great importance in Quantitative Structure-Activity Relationship (QSAR) analysis. This problem has been solved using some meta-heuristic algorithms such as: GA, PSO, ACO, SA and so on. In this work two novel hybrid meta-heuristic algorithms i.e. Sequential GA and LA (SGALA) and Mixed GA and LA (MGALA), which are based on Genetic algorithm and learning automata for QSAR f...

متن کامل

Sequential and Mixed Genetic Algorithm and Learning Automata (SGALA, MGALA) for Feature Selection in QSAR

Feature selection is of great importance in Quantitative Structure-Activity Relationship (QSAR) analysis. This problem has been solved using some meta-heuristic algorithms such as: GA, PSO, ACO, SA and so on. In this work two novel hybrid meta-heuristic algorithms i.e. Sequential GA and LA (SGALA) and Mixed GA and LA (MGALA), which are based on Genetic algorithm and learning automata for QSAR f...

متن کامل

Comparison of particle swarm optimization and tabu search algorithms for portfolio selection problem

Using Metaheuristics models and Evolutionary Algorithms for solving portfolio problem has been considered in recent years.In this study, by using particles swarm optimization and tabu search algorithms we  optimized two-sided risk measures . A standard exact penalty function transforms the considered portfolio selection problem into an equivalent unconstrained minimization problem. And in final...

متن کامل

The project portfolio selection and scheduling problem: mathematical model and algorithms

This paper investigates the problem of selecting and scheduling a set of projects among available projects. Each project consists of several tasks and to perform each one some resource is required. The objective is to maximize total benefit. The paper constructs a mathematical formulation in form of mixed integer linear programming model. Three effective metaheuristics in form of the imperialis...

متن کامل

Negative Selection Based Data Classification with Flexible Boundaries

One of the most important artificial immune algorithms is negative selection algorithm, which is an anomaly detection and pattern recognition technique; however, recent research has shown the successful application of this algorithm in data classification. Most of the negative selection methods consider deterministic boundaries to distinguish between self and non-self-spaces. In this paper, two...

متن کامل

Application of Genetic Algorithms for Pixel Selection in MIA-QSAR Studies on Anti-HIV HEPT Analogues for New Design Derivatives

Quantitative structure-activity relationship (QSAR) analysis has been carried out with a series of 107 anti-HIV HEPT compounds with antiviral activity, which was performed by chemometrics methods. Bi-dimensional images were used to calculate some pixels and multivariate image analysis was applied to QSAR modelling of the anti-HIV potential of HEPT analogues by means of multivariate calibration,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995